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Abstract 


The  formulation  of  the  decision  making  of  a  failure  detection 
process  as  a  Bayes  sequential  decision  problem  (BSDP)  provides 
a  simple  conceptualization  of  the  decision  rule  design  problem. 

As  the  opcimal  "ayes  rule  is  not  computable,  a  methodology  that 
is  based  on  Che  Baysian  approach  and  aimed  at  a  reduced. computa¬ 
tional  requirement  is  developed  for  designing  subopeimal  rules. 

A  numerical  algorithm  is  constructed  to  facilitate  the  design  and 
performance  evaluation  of  these  subopeimal  rules.  The  result  of 
applying  this  design  methodology  to  an  example  shows  that  this 
approach  is  a  useful  one. 
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1.  INTRODUCTION 

The  failure  detection  and  identification  (FDI) 
process  involves  monitoring  the  sensor  measurements 
or  processed  measurements  known  as  che  residual  [1] 
for  changes  from  its  normal  (no-fail)  behavipr.  Re¬ 
sidual  samples  are  observed  in  sequence.  If  a  failure 
is  judged  to  have  occurred  and  sufficient  information 
(from  che  residual)  has  been  gathered,  the  monitoring 
process  is  stopped.  Then,  based  on  Che  pasc  obser¬ 
vations  of  residual,  an  idencif icacion  of  the  failure 
is  made.  If  no  failure  has  occurred,  or  if  the  in¬ 
formation  gathered  is  insufficient,  monitoring  is  not 
interrupted  so  that  further  residual  samples  may  be 
observed.  The  decision  to  interrupt  che  residual- 
monitoring  to  make  a  failure  identification  is  based 
on  a  compromise  between  the  speed  and  accuracy  of  che 
detection,  and  che  failure  identification  reflects 
the  design  tradeoff  among  che  errors  in  failure  clas¬ 
sification.  Such  a  decision  mechanism  belongs  to  the 
extensively  studied  class  of  sequential  tests  or  se¬ 
quential  decision  rules.  In  this  paper,  we  will  em¬ 
ploy  the  Bayesian  Approach  (2]  to  design  decision 
rules  for  FDI  systems. 

In  Section  2,  we  will  describe  the  Bayes  formu¬ 
lation  of  che  FDI  decision  problem.  Although  the 
opciaal  rule  Is  generally  not  computable,  the  struc¬ 
ture  of  che  Bayesian  approach  can  be  used  to  derive 
practical  subopeimal  rules.  We  will  discuss  Che  de¬ 
sign  of  subopeimal  rules  based  on  Che  Bayes  formula¬ 
tion  in  Section  3.  In  Section  4,  we  will  report  our 
experience  wich  this  approach  to  designing  decision 
rules  through  a  numerical  example  and  simulation. 


2.  THE  3AYESIAN  APPROACH 

The  BSDP  formulation  of  che  FDI  problem  consists 
of  six  elements: 

1)  0:  the  set  of  states  of  nature  or  failure 
hypotheses.  An  element  3  of  0  may  denote  a  single 
cype  i  failure  of  size  v  occurring  ac  time  r(9- 

(i,T ,v) )  or  the  occurrence  of  a  set  of  failures  (pos¬ 
sibly  simultaneously) ,  i.e.  9*{  (i^ ,  ,v^) . (ln,rn, 

vn)}.  Due  to  the  infrequent  nature  of  failure,  we 
will  focus  on  Che  case  of  a  single  failure. 

In  many  applications  it  suffices  to  jusc  identify 
the  failure  type  without  estimating  the  failure  size. 
Moreover,  it  is  often  true  that  a  detection  system 
based  on  (i,r ,\T)  fi  r  some  appropriate  ~  can  also  de¬ 
tect  and  identify  che  cype  of  the  failure  (i,r,v)  for 
v>v.  Thus,  we  may  use  (  i , t ,  vj  to  represent  (i,r). 

In  the  aircraft  sensor  FDI  problem  [3],  for  inscance, 
excellent  results  were  obtained  using  this  approach. 
Now  we  have  the  discrete  nature  set 

3  -  (  (  i ,  t)  ,  i-1 . M,  t-1.2 . ) 

where  we  assume  chere  are  M  different  failure  types 
of  incerest. 

2)  u:  the  prior  probability  mass  function  (?MF) 
over  Che  nature  sec  3.  This  PMF  represents  the  a 
priori  Information  concerning  the  failure,  i.e.  how 
likely  it  is  for  each  cype  of  failure  to  occur,  and 
when  is  a  failure  likely  to  occur.  Because  this  in¬ 
formation  may  not  be  available  or  accurrate  in  some 
cases,  che  need  co  specify  u  is  a  Jrawback  of  the 
Bayes  approach  for  such  oases.  Nevertheless,  we  will 
see  that  1c  can  be  regarded  as  a  parameter  in  the  de¬ 
sign  of  a  Sayes  rule. 

In  general,  j  njv  be  arbitrary.  Hero,  -c  a ssu-o 
che  underlying  failure  orocess  his  two  pricer:  ie,  : 


i) 

the 

M  fa 

i  Lures 

j  t  '  a  r  * 

ind 

epoc'icr.t  of 

one  mother. 

and  i  1 

)  the 

occur: 

■unco  or 

o  i  c  h 

:  ;i  i  lure  i 

So 

rnou 

111  ? 

rocoss 

with  ■  *  ; 

: :  e  .n 

O  p  V  T  P  ut  J 

r  -  j  • 

So 

rnou 

1  ti  ? 

roce 

('orr.s" 

.1.:  t  >:  ? 

>  '.on  ?roc- 

•?:> 

j  m 

.ont 

inuC'ia 

t  i  *  •  •.  4 

l  c 

:  —  ie  t 

:  or  :  j  :  1  ;r«-j 

;  n 

p  .1  y 

s  tea  i 

torpor 

■_  ;  :i 

.  »•  r  i  •' 

.1  '  \-Z  .  ' 

1'HIS  PAGE  15  BESI  QUALITY  PRACIICABI^ 
k’tu  ;J.  cut  Y  FUrw'loHED  TO  DDC  ,,, - « 


.~A-V\ 


describes  a  large  class  of  failures  (such  as  sensor 
failures)  while  providing  a  simple  approx inac ion  for 
the  others.  It  is  straightforward  to  show  that 


u(i,T)»o(i)p(l-p) 


i-l,...,M,  t-1,2,..., 


P-1  -  II  (1-P,) 
j-l  3 

_1  Bt  -11 

o(i)-p.(l-p  )  t  0  . (1—0 , )  l]  1 
j-l  J  3 

The  parameter  p  may  be  regarded  as  the  parameter  of 
the  combined  (Bernoulli)  failure  process  -  the  oc¬ 
currence  of  the  first  failure;  c(i)can  be  interpreted 
as  the  marginal  probability  that  the  first  failure 
is  of  type  i.  Note  that  the  present  choice  of  u  in¬ 
dicates  the  arrival  of  the  first  failure  is  memory¬ 
less.  This  property  is  useful  in  obtaining  time- 
invariant  suboptimal  decision  rules. 

3)  t>(k) :  che  discrete  set  of  terminal  actions 
(failure  identifications)  available  to  the  decision 
maker  when  che  residual-monitoring  is  stopped  at  time 
k.  An  element  6  of  (J(k)may  denote  the  pair  (j,t), 
i.e.  declaration  of  a  type  j  failure  to  have  occurred 
at  time  t.  Alternatively,  6  may  represent  an  iden¬ 
tification  of  the  j-th  failure  type  without  regard 
for  the  failure  time,  or  it  nay  signify  the  presence 
of  a  failure  without  specification  of  its  type  or 
time,  i.e.  simply  an  alarm.  Since  the  purpose  of  FDI 
is  to  detect  and  identify  failures  that  have  occurred 
P(k)  should  only  contain  identifications  that  either 
specify  failure  times  at/before  k,  or  do  not  specify 
any  failure  time.  As  a  result,  the  number  of  ter¬ 
minal  decisions  specifying  failures  cimes  grows  with 
k  while  the  number  of  decisions  not  specifying  any 
time  will  remain  the  same.  In  addition,  D(k)  does 
not  include  the  declaration  of  no  failure,  since  the 
residual-monitoring  is  stopped  only  when  a  failure 
appears  to  have  occurred. 

4)  L(k;9,6):  the  terminal  decision  cost  func¬ 
tion  at  tine  k.  L(k;6,5)  denotes  the  penalty  for 
deciding  SeP(k)  at  tine  k  when  the  true  state  of 
nature  is  8«(i,r).  It  is  assumed  to  be  bounded  and 
non-negative  and  have  the  structure: 


ru 

(k;(t,r),i)-^ 


L((i,T),S)  x<k,  «cO(k) 


t>k  dcC(k) 


where  LO.S)  is  the  underlying  cost  function  that  is 
independent  of  k;  Lp  denotes  Che  penalty  lor  a  false 
alarm,  and  it  may  be  generalized  to  be  dependent  on 
i .  It  is  only  meaningful  for  a  terminal  action 
(identification)  that  indicates  the  correct  failure 
(and/or  time)  to  receive  a  lower  decision  cost  than 
one  chat  indicates  the  wrong  failure  (and/or  tine), 
'•'e  further  assume  Chat  the  penalty  due  to  an  incor¬ 
rect  identification  of  the  failure  time  is  only  de- 
rsoaenc  on  che  error  of  such  an  identification.  That 
is  for  :*<J,t), 

UU,T).(j,c))  -  Ui,j,(r— t)> 

•■no  .‘or  i  with  no  tire  specification 

L((  i.r)  L(i,M 


5)  r(k)  :  if r.-d  i.-.ensional  residual  (observa- 

■  n  •  -c  .c-ee.  •'«  ••■'•all  let  p  <"  r  ( 1 ) . r(k)  :  (i,t)) 

rate  joint  v-uuit  tonal  density.  Assuming 


that  the  residual  is  affected  by  che  failure  in  a 
causal  manner,  its  conditional  density  has  Che  prop¬ 
erty 

P(r(l) . r(k)|(i,T))=p(r(l) . r(k)|(0,-)) 

i“l . M,  t  >k 

where  (0,-)  is  used  to  denote  the  no-fail  condition. 
For  che  design  of  suboptimal  rules,  we  will  assume 
that  the  residual  is  an  independent  Gaussian  sequence 
with  V(cxm  matrix)  as  the  time-independent  covariance 
function  and  g^(k-T)  as  the  mean  given  chat  the  fail¬ 
ure  (i,r)  has  occurred.  With  the  covariance  assumed 
to  be  the  same  for  all  failures,  the  mean  function 
g  (k-r),  characterizes  che  effect  of  the  failure 
(i.t),  and  it  is  henceforth  called  che  signature  of 
(1,t)  (with  g.(k-t)*0,  for  i**0,  or  t>k).  We  have 
chosen  to  study  this  type  of  residuals  because  its 
special  structure  facilitates  the  development  of  in¬ 
sights  into  the  design  of  decision  rules.  Moreover, 
Che  Gaussian  assumption  is  reasonable  In  many  problems 
and  has  met  with  success  in  a  wide  variety  of  appli¬ 
cations,  e.g.,  [3]  [4].  (It  should  be  noted  that  che 
use  of  more  general  probability  densities  for  the 
residual  will  not  add  any  conceptual  difficulty.) 

6)  c(k,(i,')):  the  delay  cost  function  having 

the  properties: 


c(k,(i,x)) 


c(i,k-T)  >  0  T<k 


c(i,k1-r)>c(i,k2-T) 


k^k^T 


After  a  failure  has  occurred  at  t,  there  is  a  penalty 
for  delaying  the  Certninal  decision  until  tine  k>T 
with  the  penalty  an  increasing  function  of  the  delay 
(k-t).  In  the  absence  of  a  failure,  no  penalty  is 
imposed  on  the  sampling.  In  this  study  we  will  con¬ 
sider  a  delay  cost  function  chat  is  linear  in  the 
delay,  i.e.  c(i,k--)»c(i) (k-r) ,  where  c(i)  is  a  posi¬ 
tive  function  of  the  failure  type  i,  and  may  be  used 
to  provide  different  delay  penalties  for  different 
types  of  failures. 

A  sequential  decision  rule  naturally  consists  of 
two  parts:  a  stopping  rule  (or  sampling  plan)  and  a 
terminal  decision  rule.  The  stopping  .rule,  denoted 

by  #»(?(0)  ,d(l;r(l>) . f(k;r(l) . r(k)),...)  is  a 

sequence  of  functions  of  the  observed  residual  sam¬ 
ples,  with  $(k;r(l) . r(k))*l,  or  0.  When 

p(k;r(l) , . . . ,r(k) )«1,  (0),  residual-monitoring  or 
sampling  is  stopped  (continued)  after  the  k  residual 
samples,  r(l) , . . . ,r (k)  are  observed.  Alternatively, 
the  stopping  rule  may  be  defined  by  another  sequence 
of  functions  f»('ii(0)  ,4/(1  ;r(l) ) , . . .  ,«(k;r(l) , . . . , 

r(k)),...),  where  i>(k;r(l) . r(k))«l  (0)  indicates 

that  residual-monitoring  has  been  carried-  on  up  to 
and  including  time  (k-1)  and  will  (not)  be  stopped 
after  time  k  when  residual  sanples,  r(l) , . . . ,r(k)  are 
observed.  The  functions  t  and  V  are  related  to  each 
other  in  the  following  wav 


ij.(k;r(l) . r(V.))  = 

k-1 


: ( k : r ( 1 ) . r(k))  . 

C-;<s,r(l) . r (s) )  ] 


with  i(0)-:(0) . 

The  terminal  decision  rule  is  a  sequence 

functions,  D-(d < 0) ,d (1 ;r fl) ) . d(k:r(l) . 

...),  rapping  residual  s.-.-ples,  r  ( 1 ),...,  r  (k) 
the  terminal  action  set  ?{k) .  The  function 
4(k;r(l) .... ,r(k))  r.  :: -Seats  the  decision  ru] 
to  arrive  at  an  action  ( icent if ication)  if  sat 


or 

r  ( k ) ) , 
into 


Ui*  .'/iAui' 


/Hu*  Cw: 


V  — e  *  - 


is  stopped  at  tine  k  and  the  residual  samples,  r(l), 
. . . ,r(k)  are  observed. 

As  a  result  of  using  the  sequential  decision 
rule  (J,D),  given  (i,t)  is  the  true  state  of  nature, 
the  tocal  expected  cost  is: 


UnEU,*).(*,D)l-4  S.  T{*(k;r(l) . r  (k) )  [c  (k,  (  i,  t)  )  + 

u  k-0  L,T 

L(fc; (i, t) ,d(k;r(l) , . . . ,r(k) } ) ] } 


the  3SD?  is  defined  as:  determine  a  sequential  deci¬ 
sion  rule  (J*,D*)  so  that  the  sequential  3ayes  risk 
lis  is  minimized,  where 

M  » 

U,(>,0)*E’Jo[(i,t),(},D)K  -  w(i,t)U0[(i,r), 0.0)1 

i-1  t-1 

is  called  che  Bayes  Sequential  Decision  Rule 
(3SDR)  with  respect  co  a,  and  it  is  optimal  in  the 
sense  thac  it  Qiniaizes  the  sequential  Bayes  risk. 

In  the  following  we  will  discuss  an  interpreta¬ 
tion  of  the  sequential  risk  for  che  FDI  problem.  Let 
us  define  the  following  notacior. 

T-1 

?fCt>*  1  E0  _j/(k;r(l) . r(k>) 

k»l 


5(k,:)-{[r(l),...,r(k)]: 

w(k;r(l) .... , r(k)-l,d(k,r(l) , ... , r(k))»3>,  SzD 


relationships  anong  the  various  performance  issues. 

The  advantage  of  the  indirect  approach  is  that  only 
the  tocal  expected  cosc  inscead  of  every  individual 
performance  issue  needs  to  be  considered  explicitly  in 
designing  a  sequential  rule.  The  drawback  of  che  ap¬ 
proach,  however,  lies  in  the  choice  or  a  set  of  appro¬ 
priate  cost  functions  (and  sometimes  the  prior  distri¬ 
bution)  when  che  physical  problem  does  not  have  a  nat¬ 
ural  sec,  as  it  doesn't  in  general.  In  this  case,  the 
Bayes  approach  is  most  useful  with  the  cost  functions 
(and  the  prior  distribution)  considered  as  design 
parameters  that  may  be  adjusted  to  obtain  an  acceptable 
design. 

The  opcinal  terminal  decision  rule  D*  can  be  eas¬ 
ily  shown  to  be  a  sequence  of  fixed-sample-size  tests 
[2).  The  determination  of  the  optimal  stopping  rule 
4*  is  a  dynamic  programming  problem  [1).  The  immense 
storage  and  computation  required  make  5*  impossible  to 
compute,  and  subopcimal  rules  mst  be  used. 

Despite  Che  impractical  nature  of  its  solucion, 
the  3SDP  provides  a  useful  framework  for  designing 
subopcimal  decision  rules  for  che  FDI  problem  because 
of  its  inherenc  characteristic  of  explicitly  weighing 
che  tradeoffs  between  detection  speed  and  accuracy  (in 
terms  of  the  cosc  structure).  A  sequential  decision 
rule  defines  a  set  of  sequential  decision  regions 
S(k,5),  and  the  decision  regions  corresponding  to  the 
BSDR  yield  Che  minimum  risk.  From  this  vantage  point, 
the  design  of  a  suboptimai  rule  can  be  viewed  as  che 
problem  of  choosing  a  sec  of  decision  regions  that 
would  yield  a  reasonably  small  risk.  This  is  the  es¬ 
sence  of  the  approach  to  subopcimal  rule  design  Chat 
we  will  describe  next. 


?  '5(k,£)ji,T}«  /  P(r(l),...,r(k)|i,T)dr(l)...dr(k) 
S(k.o) 


3 .  SUBOPTIMAL  RULES 


t(i.T)-:  (k-T)(l-??(T))*1E.  *(k;r(l) . r(k)) 

k-T  “  ' 

?( (i.c) ,£)*  Z  ?c{S(k,5)|i,T}(l-?F)_1 
k=r 

where  PF(t)  is  the  probability  of  stopping  to  declare 
a  failure  before  che  failure  occurs  at  T,  i.e,  che 
probability  of  false  alarm  when  a  failure  occurs  at 
:ine  t;  D  is  the  sec  of  terminal  actions  for  all  times; 
S(k,5)  is  the  region  in  che  sample  space  of  the  first 
k  residuals  where  Che  sequential  rule  (i,D)  yields  che 
terminal  decision  3.  Clearly,  che  S(k,5)'s  are  dis¬ 
joint  secs  with  respect  co  both  k  and  5.  The  expres¬ 
sions  s(i,x)  and  P((i,t),S)  are  che  conditional  ex¬ 
pected  delay  in  decision  (i.e.  stopping  sampling  and 
making  a  failure  identification)  and  che  conditional 
probability  oc  eventually  declaring  a,  given  a  type  i 
failure  has  occurred  at  time  t  and  no  false  alarm  has 
bean  signalled  before  chis  time  respectively. 

?((’-, t), 5)  is  Che  generalized  cross-detection  proba¬ 
bility.  Finally,  the  sequential  Bayes  risk  U3  can  be 
written  as 

M  » 

, D) -r  Z  u(i,r)a-P_(T)  +  (l-?_(T))[c(i)c(i,c)  + 

»  ...  r  r  F 

1  L((i,r) , :)?((i,r)  ,3) ] )  (1) 

i-iV 


The  Sliding  L’indow  Approximation 

The  immense  computation  associated  with  the  BSDR 
is  partly  due  to  the  increasing  number  of  failure 
hypotheses  as  time  progresses.  The  remedy  for  chis 
problem  is  the  use  of  a  sliding  window  to  limit  the 
number  of  failure  hypotheses  to  be  considered  ac  each 
time.  The  assumption  made  under  the  sliding  window 
approximation  is  thac  essentially  all  failures  can  be 
detected  within  W  cime  seeps  after  they  have  occurred, 
or  thac  if  a  failure  is  not  detected  within  chis  time 
it  will  not  be  dececced  in  che  fucuru.  Here,  the  win¬ 
dow  size  W  is  a  design  parameter,  and  it  should  be 
chosen  long  enough  so  that  detection  and  identification 
of  failures  are  possible,  but  short  enough  so  thac 
Implementation  is  feasible  [1].,.  w 

The  sliding  window  rule  (i",d'  )  divides  the  sample 
space  of  the  sliding  window  of  residuals  { r (k-V+1) , 
...,r(k)},  or  equivalently,  the  space  of  veccors  of 
posterior  probabilities,  likelihood  ratios,  or  log 
likelihood  ratios  (1)  of  che  sliding  window  of  failure 
hypotheses  into  disjoint  cime-independenc  sequential 
decision  regions  {Sq,S^  , .  .  .  ,3^; .  Because  the  residuals 
are  assumed  to  be  Caussian  variables,  it  is  simpler  Co 
work  with  L  (which  is  related  to  1  by  a  constant): 

i(k)-((.g(k) . (k)l' 

where 


Equation  (1)  indicates  that  the  sequential  3ayes 
risk  is  a  weighted  cc.-.b mat  ion  of  the  cond clonal  false 
.tier-  probability,  expected  delay  co  decision  inJ 
t  r’  ;  .-detect  ion  probabilities,  and  che  optima’,  sequeu- 
tci'.  r  :le  (f*.D*)  minimize?:  such  a  combination.  From 
tats  vantage  point,  the  coat  functions  <L  lnd  c)  and 
trior  Ji-str  ih-.t  inn  ( pro  vide  for  Che  weighting, 

specify  tag  the  tradeoff 


L-(k)  =  [  L(k;  l.r) . Uk:>!,:)]  * 

t(k;i,r)«:  g  !  f  s )  1  r  ('■■-- r-s  1 

s’O  1 
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L(k)cSg,  and  we  will  proceed  to  take  one  more  obser¬ 
vation  of  the  residual.  The  Bayes  design  problem  is 
to  determine  a  set  of  regions.  {Sq.sJ,  ...  .Sjjj,)  that  min¬ 
imizes  the  sequential  risk  U ^ ( { S j } ) .  This  represents 
a  functional  minimization  problem  for  which  a  solution 
is  generally  very  difficult  to  determine.  A  simpler 
alternative  to  this  problem  is  to  constrain  the  deci¬ 
sion  regions  to  cake  on  special  shapes,  {Sj_(£)},  that 
are  paramecerized  by  a  fixed  dimensional  vector,  f, 
of  design  variables.  Then  the  resulting  design  pro¬ 
blem  involves  the  determination  of  a  set  of  parameter 
values  f*  that  minimizes  the  risk  U^(f).  We  will 
focus  our  attention  on  a  special  set  of  parametrized 
sequential  decision  regions,  because  they  are  simple 
and  chey  serve  well  to  illustrate  that  the  Bayes 
formulation  can  be  exploited,  in  a  systematic  fashion, 
co  obcain  simple  suboptimal  rules  chat  are  capable  of 
delivering  good  performance.  These  decision  regions 
are: 

S(j ,c)»{l(k)  :  l(k; j ,t)>f (j  ,t)  , 

t"1(.NHm(k:j,t)-f(j,01>£"1(i,7)U(k;l,7)-f(i,:?), 

U.OfKj.e)}  (3a) 

S(0,-)-(l(k)  :  L(k;i,t)<f (i,r) , 

i»l,...,M,  t*0, ... ,W— 1 )  (3b) 

where  S(j,c)  is  Che  stop-to-declare  (j,k-c)  region  and 
S(0,-)  is  the  continue  region  (see  Fig.  1).  Generally 
the  c’s  nav  be  regarded  as  design  parameters,  but 
here,  s(j,c)  is  simply  taken  to  be  the  standard  de¬ 
viation  of  l(k,J,t). 

To  evaluate  U^f(f),  we  need_to  determine  the  set 
of  probabilities,  (Pr{ L(k)cS(j ,t) ,L(k-l) eS(0, -),...  , 

i(V)eS(0.-) ! i,r) .  k>W,  j«0,l,...,M,  t-0 . W-l}, 

which,  indeed,  is  the  goal  of  many  research  efforts  in 
the  so-called  level-crossing  problem  (5).  Unfortu¬ 
nately,  useful  results  (bounds  and  approximations  of 
such  probabilities)  are  only  available  for  the  scalar 
case  (6],(7],[S).  As  it  stands,  each  of  che  proba¬ 
bilities  is  an  integral  of  a  kMW-diroensional  Gaussian 
density  over  the  compound  region  S(0,-)x. . .xS(0,-) 
xS(j,c),  which,  for  large  kMU,  becomes  extremely  un¬ 
wieldy  and  difficult  to  evaluate. 

The  MV-dimensional  veccor  of  decision  statistics 
l(k)  corresponds  to  che  MW  failure  hypotheses,  and 
they  provide  the  information  necessary  for  the  simul¬ 
taneous  identification  of  both  failure  type  and  fail¬ 
ure  time.  In  most  applications,  such  as  the  aircraft 
3ensor  FDI  problem  [3]  and  the  detection  of  freeway 
traffic  incidents  (4),  where  the  failure  time  need  noc 
be  exolicitly  identified,  the  f  .lure  time  resolution 
power  provided  by  the  full  window  of  decision  statis¬ 
tics  is  not  needed.  Instead,  decision  rules  that 
employ  a  few  components  of  (.(k)  may  be  used.  The 
decision  rule  of  this  type  considered  here  consists 
of  sequential  decision  regions  that  are  similar  to 
(3)  but  are  only  defined  in  terms  of  M  components  of 
l  (k)  : 

:  L(k;  j  ,W-l)>f } 

c'1'j.'.:-l)!L(k;j,W-l)-fj)>c'1(i.W-l)(L(k,i,W-n-fjl  , 


3o*;  ^k)  ;  j*l . M;  (4b) 

-here  $;  is  the  stop-to-declarc-faf lure-j  region  and 
Sq  is  the  continue  region.  Tt  should  be  noted  that 
the  use  of  (4)  is  efi-ctive  if  cross-correlations  of 
signatures  >-ong  hypotheses  of  the  sa-e  failure  tvpe 
at  different  :  i-es  are  -mailer  than  those  among  hypo¬ 


theses  of  different  failure  types. 

The  risk  for  using  (4)  is 

Jgr?tV-1Ck)€Sj.S0(k-l)!0.-) 

+  r  r  p(i,t)  z  ‘f  [c(i)(k-T)+L(i,j)i 

i=l  r*l  k”max[W,t]  j*l 

x  PrfLy^  (k)cS^. ,  SQ(k-l) |i,t} 


5o(,‘c)’{Lv-i(k)£:So"--'Lw-i(u)£So} 

The  probabilities  required  for  calculating  the  risk 
are  given  by  the  recursion: 

p(Lw_1(k+l)|S0(k),i,T)  - 

[/  p(lv_l(k) |SQ(k-l) ,i,T)dLv_1(k)]_1 

x  °;  p(iw_1(k+i) !  l^oo  ,s0(k-i)  ,i,T)» 

P(Lv01(k)|S0(k-l),i,T)dL._1(k)  k>W  (5 

Pr{Lw_1(k)£Sj,  S0(k-l)!i,T>  -  Pr{50(k-l)|i,r)- 

/  pOv_1(k)!S0(k-l),i,r)dlv_1(k),  j-0,l,...,M  (6 


PrUw_1(W)£S,|i,T}  -  /  pOv_1(W)ii.T)dIw_1(W)  (7) 

J  Sj  •"  .. 

For  M  small,  numerical  integration  of  ( S)  —  ( 7 )  becomes 
manageable. 

Unfortunately,  the  transition  density, 

pa„.xck+i)Kw-i<fc>.Vk-i).i.7),  ret?uired  in  (5)  is 

difficult  to  calculate,  because  Ly_i(k)  is  not  a 
Markov  process.  In  order  to  facilitate  computation 
of  the  probabilities,  we  need  to  approximate  the 
transition  density.  In  approximating  the  required 
transition  density  for  i-',-_x(k)  we  are,  in  fact,  ap¬ 
proximating  the  behavior  of  Ly.j.  A  simple  approx¬ 
imation  is  a  Gauss-Markov  process  l(k)  that  is  defined 
by 

£(k+l)  -  A£ (k)  +  i (k+1) 

E(f (k)C ' (t) }  *  BB'u0(k-t) 

where  A  and  B  are  MxM  constant  matrices  and  5  is  a 
white  Gaussian  sequence  with  covariance  equal  to  the 
(MxM)  matrix  3B’.  The  reason  for  choosing  this  model 
is  twofold.  Firstly,  just  as  Ly-x(k),  t(k)  is 
Caussian.  Secondly,  t(k)  is  Markov  so  that  its  tran¬ 
sition  density  can  be  readily  determined.  In  order  to 
have  I(k)  behave  like  U,_^(k),  we  set  the  matrices  A 
and  B  and  the  mean  of  7.  such  that 


Ei.t< 

Kk>;-E.  iu.  , (k) } 

(8) 

Eo,-' 

1  (fc)i  ’  (V.)  '•  *E0f  J  Lv_1  (k)  L,'.^)  } 

(9) 

”0 ,  -  • 

i  fu 1’  (k-wi)  ;■  *r.0> .  ■;  u._1  (k)  i.,_1  (k+i) ) 

(10) 

That  is,  we  have  matched  the  marginal  density  and  the 
one-step  cross-covariance  of  i(k)  to  those  of  l,^(k). 
It  car.  be  shown  that  (S )-<10)  uniquely  specify 
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”1  “0 
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Moreover,  the  macrtx  A  is  stable,  i.e.  the  aagnicudes 
of  all  of  the  eigenvalues  of  A  are  less  chan  unity, 
and  3  is  invertible  if  Gq  or  Gy,  is  of  rank  M.  Be¬ 
cause  i  is  an  artificial  process  (i.e.  5  is  not  a 
direct  function  of  the  residuals  r(k))  i(k)  can  never 
be  implemented  for  use  in  (4) . 

Ve  nay  choose  ocher  Markov  approximations  of 
(’*)  4hac  macch  xhe  n-step  cross-covariance  (l<n<W) 
instead  of  eacching  the  one-step  cross-covariance  as 
in  (10).  The  suitability  of  a  criterion  for  choosing 
the  natrices  A  and  S,  such  as  (9)  and  (10),  depends 
directly  on  the  failure  signatures  under  consideration 
and  nay  be  examined  as  an  issue  separata  from  the 
decision  rule  design  problem.  Also,  a  higher  order 
Markov  process  may  be  used  to  approximate  Lw-l-  How¬ 
ever,  the  increase  in  the  computational  complexity 
-ay  negate  the  benefits  of  the  approximation. 

Mow  we  can  approximate  the  required  probabilities 
in  the  risk  calculation  as 


?r  :U._1(k)ssj  ,S0(k-l)  |  i,T}s!?rU(k)sS  .  ,S0(k-l)  |  i,t} 


j-0,1 . M  k>W 


and 


?r(l(k)e5j,  S0(k-l)|i,r) 


where  we  have  applied  the  same  decision  rule  to  l(k) 
as  L^Ck).  Therefore,  Sj  and  SqCx-I)  denoce  the 
decision  regions  and  the  event  ot  continued  sampling 
up  to  time  k  for  both  Ly_^  and  1.  Assuming  B_* 
exists,  we  have 

?(i(k+l)|S0(k),i,T)  -  [/  p(l(k)|S0(k-l),i,T)d/.(k)]'1 
s0 

x  !  p(5 (k+1)  -  U(k+l)-At(k))li,-) 

So 

?(?.(k)  is0(k-l),i,r)d;.(k),  k>W  (12) 

where  ?(l(k)'i,T)  is  che  Caussian  density  of  y(k) 
ur.ter  the  failure  (i,:).  Mow  the  integrals  (11)  and 
(12)  represent  more  tractable  numerical  problems. 

In  the  event  that  3  is  not  invertible,  the  tran¬ 
sition  iensity  is  degenerate  and  (12)  is  very  difficult 
to  evaluate.  Very  often  this  problem  can  be  circum¬ 
vented  by  bacch  processing  the  residuals.  Tivjt  is,  we 
-.ly  consider  the  mod  if  icd  residual  sequence:  r(k)  * 

'r'  .  k-v-l)  ,r'  fvk— /*2) . r'(vx)I'  for  some  batch 

.-0  with  <*1.2,...  is  the  new  time  index.  In 


using  r(k)  we  have  to  augment  the  signatures  as: 

ts<(0) . S_<  ( v— 1 )  ]  *  ,  i-l,...,M.  By  a  proper  choice 

of*v,  the  rank  of  Gq  can  be  increased  to  M  and  3  will 
be  invertible. 

Non-Window  Sequential  Decision  Rules 

Here  we  will  describe  another  simple  decision 
rule  thac  has  che  same  decision  regions  as  the  simpli¬ 
fied  sliding  window  rule  (4),  buc  che  vector  (a)  of  M 
decision  statistics  is  obcained  differently  as  follows 

z(k+l)  -  A  z(k)  +  3  r (k+1)  (13) 

where  A  is  a  constant  stable  MxM  matrix,  and  3  is  a 
>1x3  constant  matrix  of  rank  M.  Unlike  the  Markov 
model  l(k)  that  approximates  Ly_^(k),  z(k)  Is  a 
realizable  Markov  process  driven  by  che  residual.  The 
advantages  of  using  z  as  the  decision  statistic  are: 

1)  less  storage  is  required,  because  residual  samples 
need  noc  be  stored  as  necessary  in  Che  sliding  window 
scheme,  and  2)  since  z  is  Markov,  che  required  proba¬ 
bility  integrals  are  of  the  form  (11)  and  (12)  so  that 
the  same  integration  algorithm  can  be  directly  applied 
to  evaluate  such  integrals.  (Ic  is  possible  to  use  a 
higher  order  z,  but  che  added  complexity  will  negate 
che  advantages.) 

In  order  to  form  the  statistics  z,  we  need  to 
choose  the  matrices  A  and  3.  When  the  failure  signa¬ 
tures  under  consideration  are  constant  biases,  3  can 
simply  be  set  to  equal  Cq,  and  A  can  be  chosen  to  be 
al ,  where  0<a<l.  Then,  che  term  3r  in  (13)  resembles 
g’V~*r  of  (2),  and  it-  provides  the  correlation  of  the 
residual  with  the  signatures.  The  time  constant 
(1/1-a)  of  z  characterizes  the  memory  span  of  z  jusc 
as  ’»!  characterizes  chat  of  the  sliding  window  rules. 

More  generally,  if  we  consider  failure  signatures 
thac  are  not  constant  biases,  che  choice  of  A  may 
still  be  handled  in  the  same  way  as  in  the  constant- 
bias  case,  but  the  selection  of  a  8  matrix  is  more 
involved.  With  some  insights  inco  the  nature  of  che 
signatures,  a  reasonable  choice  of  3  can  often  be 
made.  To  illustrate  how  this  may  be  accomplished,  we 
will  consider  an  example  with  two  failure  modes  and  an 
m-dimensional  residual  vector.  Let 

gx(k-r)  -  Bj 

g2(k-r)  *  32(k-T+l) 

That  is,  g^  is  a  constanc  bias,  and  g.,  is  a  ramp.  If 
3.  and  3,  are  not  multiples  of  each  ocher  a  simple 
choice  of  3  is  available: 


L®2 

If  B.=a^3  and  3,“u,3,  where  and  o,  are  scalar  con¬ 
stants,  the  above  choice  of  3  has  rank  one  and  is  not 
useful  for  identifying  either  signature.  Suppose  we 
batch  process  every  two  residual  samples  cogecher,  i.e. 
ye  use  che  residual  sequences  r(k)-[r * (2k-l) , r ' (2k) 1 ’ , 
k-1,2,....  Then  we  can  set  3  to  be 


fs- 


Thtri ,  che  fir^c  and 
stanc-bias  and  ra-va  rue* 


■  i  rjvj  o: 
'  ®  L 
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(and  this  B  h3s  rank  two).  The  use  of  the  modified 
resudual  r(k)  in  this  case  causes  no  adverse  effect, 
since  it  only  lengthens  slightly  the  interval  between 
times  when  terminal  decisions  nay  be  made.  A  big  in¬ 
crease  in  such  intervals  i.e.,  the  batch  processing 
of  r(k) , . . . ,r(k+v)  simultaneously  for  large  v,  may 
however,  be  undesirable.  For  problems  where  the 
signatures  vary  drastically  as  a  function  of  the 
elapsed  time,  or  the  distinguishability  among  failures 
depends  essentially  on  these  variations,  the  effec¬ 
tiveness  of  using  z  diminishes.  In  such  cases  the 
sliding  window  decision  rule  should  provide  better 
performance  because  of  its  inherent  nature  to  look 
for  a  full  uindow's  worth  of  signature. 

Probability  Calculation 

An  algorithm  based  on  1-dimensional  Gaussian 
quadrature  formulas  [9]  has  been  developed  to  compute 
the  probability  integrals  of  (11)  and  (12)  for  the 
case  M-2.  (It  can  be  extended  to  higher  dimension 
with  an  increase  in  computation.)  The  details  of  this 
quadrature  algorithm  is  described  in  [1].  Its  accu¬ 
racy  has  been  assessed  via  comparison  with  Monte  Carlo 
simulations  (see  the  numerical  example).  With  this 
algorithm  we  can  evaluate  the  performance  probabili¬ 
ties  and  risks  associated  with  the  suboptimal  decision 
rules  described  above. 


Risk  Calculation 

In  the  absence  of  a  failure,  the  conditional 
density  has  been  observed  to  essentially  reach  a 
steady  state  at  some  finite  time  T>W.^-  Then,  for  k>T 
we  have 

Pr{t(k)cSj|So(k-l),0-}  -  S  (14) 

PrU(k)eS  ,i(k-l)eS0 . I(t)sS0|S(t-1) ,i,t}  - 

b^ (k-t ] i)  kx-r^T  (15) 


That  is,  once  steady  state  is  reached,  only  Che  rela¬ 
tive  time  (elapsed  time)  is  important.  Generally, 
fialures  occur  infrequently,  and  decision  rule  with 
low  false  alarm  probabilities  are  employed.  Thus,  it 
is  reasonalbe  to  assume  1)  p<<l  ((l-p)^=-  1),  and  2) 
Pr{SQ(T) |0,-}  =  1.  The  sequential  risk  associated 
with  (4)  for  M-2  can  be  approximated  by 


V  2  2 

U'(f)=P_L_+(l-P_)  £  c(i)£ 
f  i=l  j-1 

where 


Z  [c(i)C+L(i,j)]b.(t 
t-0  J 


i)  1 
(16) 


P  .  (l-oHl-bn) 

F  1-5  (l-o) 

Next,  we  seek  to  replace  the  infinite  sum  over  t 
in  (16)  by  Che  finite  sum  up  to  c-A  plus  a  terra  ap¬ 
proximating  the  remainder  of  the  infinite  sura.  Sup¬ 
pose  we  have  been  sampling  for  A  steps  since  the  fail¬ 
ure  occurred.  Define: 

Pt(jii)*Pr(l(c)eS  |50(c-l),i.O}  j-0,1,2 

If  we  stop  computing  che  probabilities  after  A,  we 
may  approximate 


^  Unfortunately ,  we  have  noc  been  able  to  prove 
such  convergence  behavior  using  eLemencary  techniques. 
More  advanced  function-theoretic  methods  nay  be  neces- 


Pt(j!i)=Pi(j!i)  j*C  ,  1 , 2  ,  t>A 

When  the  signature  of  che  failure  model  is  a  constant 
(including  che  no-fail  case),  che  reasoning  behind 
(14)  holds,  and  we  can  see  chat  ?c(j|i)  will  reach  a 
steady  state  value  as  t  (the  elaspsed  time)  increases. 
Then,  (17)  is  a  valid  approximation  for  a  large  A. 

For  the  case  where  failure  signatures  are  not  constants 
the  probability  of  continuing  after  a  tine  steps  (for 
sufficiently  large  A)  nay  be  arbitrarily  small.  The 
error  introduced  by  (17)  in  the  risk  (and  performance 
probability)  calculation  is,  consequently,  small. 
Substituting  (17)  in  (16),  we  get 

u  2  2 
U,"(f)=P_L_+(l-F_)i  a(i)  (c(i)t  +  £  l(i,  j)P(l,  j) )  (18) 

‘  i«l  j-1 

where 


Vji  j0C  yc|i)+b0(A|i)  6  ♦  1-p-joTI)- 

P(i’j)’tf0bi<t|i)+b°(i|i) 


(19) 

(20) 


?P  is  the  unconditional  false  alarm  probability,  i.e. 
Che  probability  of  one  false  alarm  over  all  tine,  t. 
is  the  conditional  expected  delay  to  decision,  given 
that  a  type  i  failure  has  occurred,  and  P(i,j)  is  the 
conditional  probability  of  declaring  a  type  j  failure, 
given  that  failure  i  has  occurred.  From  the  assumption 
that  Pr{5Q(T)  |0,-}«1  and  the  steady  condition  (14),  it 
can  be  shown  that  the- mean  time  between  false  alarms  is 
simply  (1-bg)-*.  Now  all  the  probabilities  in  (1S)- 
(20)  can  be  computed  by  using  the  quadrature  algorithm. 
Note  that  the  risk  expression  (18)  consists  only  of 
finite  sums  and  it  can  be  evaluated  with  a  reasonable 
amount  of  computational  effort.  With  such  an  approx¬ 
imation  of  the  sequential  risk,  we  will  be  able  to 
consider  the  problem  of  determining  the  decision 
regions  (the  thresholds  £)  that  minimize  the  risk. 

It  should  be  noted  that  we  could  consider  choosing 
a  set  of  thresholds  that  minimize  a  weighted  combina¬ 
tion  of  certain  detection  probabilities  (P(i,j)),  the 
expected  detection  delay  (t.),  and  the  mean  time  be¬ 
tween  false  alarms  ((1  -  b^-i) .  Although  such  an 
objective  function  will  not  result  in  a  Bayesian  de¬ 
sign  in  general,  it  is  a  valid  design  criterion  that 
may  be  useful  for  some  application. 


Risk  Minimization 

The  risk  minimization  problem  has  two  features 
Chat  deserve  special  attention.  Firstly,  the  sequen- 
tail  risk  is  noc  a  simple  function  of  the  threshold  f, 
and  the  derivative  with  respect  to  f  is  not  readily 
available.  Secondly,  calculating  the  risk  is  a  costly 
task.  Therefore,  che  minimum-seeking  procedure  Co  be 
used  must  require  few  function  (risk)  evaluations,  and 
it  must  not  require  derivatives.  The  sequence-of- 
quadratic-prograns  (SQ?)  algorithm  studied  by  Winfield 
[10]  has  been  chosen  to  solve  this  problem,  because  it 
does  not  need  any  derivative  information  and  it  appears 
to  require  fewer  function  evaluations  than  other  well- 
known  algorithms  [10].  Furthermore,  the  SOP  is  simple, 
and  it  has  quadratic  convergence.  Very  briefly,  che 
algorithm  consists  of  the  following.  At  each  iteration 
a  quadratic  surface  is  fitted  to  the  risk  function 
locally,  Chen  che  quadratic  model  is  minimized  over  a 
constraint  region  (hence  the  name  SQP).  The  risk 
function  is  evaluated  at  this  minimum  and  is  used  in 
the  surface  fitting  of  the  next  iteration.  The  de¬ 
tails  of  the  application  of  SQP  to  risk  minimization 
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is  reported  in  [1] . 

A.  NUMERICAL  EXAMPLE 


L(l,2)-L(2,l)-10 


L(l,l)-L(2,2)=0 


Here,  we  will  discuss  an  application  of  the  sub- 
optimal  rule  design  methodology  described  above  to  a 
numerical  example.  We  will  consider  the  detection 
and  identification  of  two  possible  failure  modes 
(without  identifying  the  failure  times).  We  assume 
that  the  residual  is  a  2-diaensional  vector,  and  the 
vector  failure  signatures,  g^(t),  i*l,2,  as  functions 
of  the  elapsed  time  t  are  shown  in  Table  1.  The 
signature  of  che  first  failure  mode  is  simply  a  con¬ 
stant  vector.  The  first  component  of  g2(c)  is  a  con¬ 
stant,  while  the  second  component  is  a  ramp.  We  have 
chosen  to  examine  these  two  types  of  signature  be¬ 
havior  (constanc  bias  and  ran?)  because  they  are  sim¬ 
ple  and  describe  a  large  variecy  of  failure  signatures 
that  are  commonly  seen  in  practice.  For  simplicity, 
we  have  chosen  V,  the  covariance  of  r,  to  be  the 
identity  matrix. 

We  will  design  both  a  simplified  sliding  window 
rule  (thac  uses  ly  ,)  and  3  rule  using  the  Markov 
statistic  r.  The  parameters  associated  with  the 
L,  , ,  i,  and  2  are  shown  in  Table  2,  and  the  cost 
functions  and  the  prior  probabilities  are  shown  in 
Table  3.  To  facilitate  discussions,  we  will  intro¬ 
duce  the  following  terminology.  We  will  refer  to  a 
Monce  Carlo  simulation  of  the  sliding  window  rule  by 
SW,  a  simulation  of  Che  rule  using  the  Markov  statis¬ 
tic  2  as  Markov  implementation  (MI),  and  a  simulation 
of  che  nonimplemencable  decision  process  using  the 
approximation  l  as  Markov  approximation  (MA) .  (All 
simulations  are  based  on  10,000  trajectones . )  The 
nocacion  Q20  refers  to  che  resulcs  of  applying  the 
quadrature  algorithm  to  the  approximation  of  by 
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Table  1.  Failure  signatures. 
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Table  3.  Cost  Functions  and  Prior  Probability. 

The  results  of  SW,  MA,  and  Q20  for  che  thresholds 
[S.85,  12.05]  are  shown  in  Figs.  2-6  (see  (15)  for  the 
definition  of  notations) .  The  quadrature  results  Q20 
are  very  close  to  MA,  indicating  good  accuracy  of  the 
quadrature  algorithm.  In  comparing  SW  with  MA,  it  is 
evident  thac  che  Markov  approximation  (MA)  slightly 
under-escimaces  the  false  alarm  race  of  che  sliding 
window  rule  (SW).  However,  Che  response  of  che  Markov 
approximation  to  failures  is  vary  close  to  thac  of  the 
sliding  window  rule.  In  the  present  example,  Ly  ,  is 
a  7-th  order  process,  while  its  approximation  i  is 
only  of  firsc  order.  In  view  of  this  face,  we  can 
conclude  that  l  provides  a  very  reasonable  and  useful 
approximation  of 

The  successive  choices  of  thresholds  by  SOP  for 
the  sliding  window  rule  are  plotted  in  Fig.  7.  Mote 
"that  we  have  not  carried  the  SO?  algorithm  far  enough 
so  thac  the  successive  choices  of  thresholds  are,  say, 
within  .001  of  each  other.  This  is  because  towards 
lacer  iterations  che  performance  indices  become  rela¬ 
tively  insensitive  to  small  changes  of  the  f’s.  This 
together  with  the  fact  that  we  are  only  computing  an 
approximate  Bayes  risk  means  that  fine  scale  opcimi- 
aacion  is  not  worthwhile.  Therefore,  with  the  approx¬ 
imate  risk,  the  SQP  is  cost  efficiently  used  to  locate 
the  zone  where  the  minimum  lies.  That  is,  the  SQP 
algorithm  is  to  be  terminated  when  it  is  evident  chat 
it  has  converged  into  a  reasonably  small, region.  Then 
we  nay  choose  the  thresholds  that  give  the  smallest 
risk  as  the  approximate  solution  of  the  minimisation. 

In  che  event  that  thresholds  thac  yield  che  small¬ 
est  risk  do  not  provide  che  desired  detection  perfor¬ 
mance,  the  design  parameters,  L,  c,  u,  and  W  may  be 
adjusted  and  che  SQP  may  be  repeated  to  get  a  new  de¬ 
sign.  A  practical  alternative  method  is  to  make  use 
of  the  list  of  performance  indices  (e.g.  P(i,j))  that 
are  generated  in  che  risk  calculation,  and  choose  a 
pair  of  thresholds  thac  yields  che  desired  performance. 

The  performance  of  the  decision  rules  using  Lj_^ 
and  z  as  determined  by  SQ?  are  shown  in  Figs.  S-12. 

(The  thresholds  for  are  (3.35,  12.05]  and  chose 

for  z  ere  [6.29,  11.69).)  We  note  chat  MI  has  a 
higher  false  alarm  rate  chan  SW.  The  speed  of  detec¬ 
tion  for  the  two  rules  is  similar.  While  MI  has  a 
slightly  higher  type-1  correcc  detection  probability 
chan  SW,  SW  has  a  consistently  higher  b2(c|2)  (type-2 
correct  detection  probability)  than  MI.  By  raising 
the  thresholds  of  the  rule  using  z  appropriately ,  we 
C3n  decrease  che  false  alarm  rate  of  MI  down  to  thac 
of  SW  with  an  increase  in  detaccior.  delay  and  slighclv 
improved  correcc  detection  probability  for  the  type-2 
failure  (with  ramp  signature).  Thus,  the  sliding 
window  rule  is  slightly  superior  to  the  rule  using  z 
in  the  sense  chat  when  both  are  designed  to  yield  a 
comparable  false  alarm  rate,  the  latter  will  have 
longer  detection  delays  and  slightly  lower  correct 
detection  probability  (for  tyre-2  failure).  In  view 
of  che  fact  thac  a  decision  rule  using  a  is  much 
simpler  to  implement,  it  is  worthy  of  being  considered 
ns  an  alternative  to  che  sliding  window  rule. 
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In  summary,  the  result  of  applying  our  decision 
rule  design  method  to  the  present  example  is  very 
good.  The  quadrature  algorithm  has  been  shown  to  be 
useful,  and  the  Markov  approximation  of  by  1  is 

a  valid  one.  The  SQP  algorithm  has  demonstrated  its 
simplicity  and  usefulness  through  the  numerical  exam¬ 
ple.  Finally,  Che  Markov  decision  statistic  z  has 
been  shown  to  be  a  worthy  alternative  to  che  sliding 
window  statistic  ly_^. 

5.  CONCLUSION 

A  mechodology  based  on  che  Bayesian  approach  is 
developed  for  designing  subopciraal  sequential  deci¬ 
sion  rules.  This  methodology  is  applied  to  a  numer¬ 
ical  example,  and  the  results  indicate  chat  it  is 
a  useful  design  approach. 
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Fig.l  Sequential  Decision  Regions  in  2  Dimensions 
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Fig. 2  b0(t/0)  -  SW,  MA,  and  Q20 


